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Abstract —The family of pointillist multitarget tracking filters 
is defined to be the class of filters that is characterized by a 
joint target-measurement finite point process. The probability 
generating functional (PGFL) of the joint process is derived 
directly from the probabilistic structure of the tracking problem. 
PGFLs exempUfy the analytic combinatoric method applied to the 
measurement to target assignment problems that are fundamental 
to the tracking problem. It is shown that multi-hypothesis track¬ 
ing (MHT), joint probabilistic data association (JPDA), and many 
other now-classic tracking filters can be derived via PGFLs, and 
thus are members of the family of pointillist filters. When one or 
more of the target processes are dimensionally compatible, targets 
can be superposed. It is shown that the classic MHT filter for 
superposed targets is closely related to the multi-Bernoulli filter. 
A technique is presented for deriving the functional derivatives 
by ordinary differentiation, and both exact and approximate 
methods for evaluating the ordinary derivatives are discussed. 


Keywords—Multitarget tracking, Analytic combinatorics, As¬ 
signment problems. Finite point processes. Superposition 


I. Introduction 

Classical methods of analytic combinatorics are used to 
characterize a large number of well known recursive, discrete¬ 
time, multitarget tracking filters. Filters that can be conceptu¬ 
alized in this way are called pointillist hlters because they are 
built around hnite point process models of the joint target- 
measurement process. Finite point processes are fully and 
uniquely characterized by probability generating functionals 
(PGFLs) | |J7| and, naturally, different tracking hlters have dif¬ 
ferent PGFLs. In tracking applications, the appropriate PGFL 
can often be derived directly from the modeling assumptions 
that underlie the tracking problem. PGFLs illuminate the sim¬ 
ilarities and differences between the combinatorial problems 
that underlie the hlters. In other words, PGFLs are the basis 
of an ontology for the pointillist family of hlters. This paper 
shows not only that many well known tracking hlters are 
pointillist hlters but also that the PGFL ontology can be used 
to derive new hlters and bring to light previously unknown 
relationships between existing hlters. 

Three classes of pointillist hlters are identihed. They are 
distinguished by their use of target superposition. One class 
does not use target superposition. Individual target states are 
maintained by the hlter, and different targets can have different 
state spaces. This class contains many well-known multitarget 
tracking hlters, including Bayes-Markov p6) , MHT (multi¬ 
hypothesis tracking) |[^, PDA (probabilistic data associa¬ 
tion) 0, JPDA (joint PDA) Q, PMHT (probabilistic multi¬ 
hypothesis tracking) | [60| , IPDA (integrated PDA) pTf , and 
the JIPDA (joint IPDA) |4§ hlters. In many instances the 
recognition that these hlters can be formulated in terms of 
PGFLs is new to this paper. 


Another class does use target superposition - there is 
only one state space, shared by all targets, and separate 
target-specihc states are not maintained by the hlter. (Note 
that targets cannot be superposed if their state spaces are 
different.) Pointillist hlters that use superposition include the 
PHD (probability hypothesis density) intensity ||32|, CPHD 
(cardinalized PHD) intensity p^ , generalized PHD intensity 
GD’ and the multi-Bernoulli intensity p4) | |68| |j^ hlters. 
Other hlters in this class are superposed versions of the JPDA, 
the JIPDA, and the PMHT hlters. The multi-Bernoulli hlter is 
seen to be equivalent to the JIPDA hlter with superposition, a 
fact that has also been studied in | |72| . 

A third class of pointillist hlters is a hybrid class, in which 
targets are selectively superposed. Targets that are superposed 
have the same state space, but the other targets do not. Two 
hybridized versions of the PHD and generalized PHD hlters 
are presented. The class of selectively hybridized hlters is, to 
the knowledge of the authors, entirely new. 

The presentation of well-known, now classic, tracking 
hlters in a unifying framework sheds new light on connections 
and differences between the hlters. Moreover, it enables an 
engineer to design customized application-specihc tracking 
hlters using a palette of available point process target and 
measurement models to obtain possibly new and unique hlters, 
matched to the requirement of the problem at hand. It is one 
of the purposes of this paper to show how an engineer can 
apply in practice the presented mathematical framework to the 
design of tracking hlters. 

The paper also presents methods for computing functional 
derivatives of the PGFL. These derivatives are used to 
obtain the probability distribution of the Bayes posterior point 
process and its summary statistics p4[ , | |5^ , pT) . The method 
of secular functions | |58| is important because it reduces func¬ 
tional differentiation to ordinary differentiation. The secular 
function is an analytic function of several complex variables, 
so its derivatives can be written as a multivariate Cauchy-style 
contour integral. The saddle point method (method of station¬ 
ary phase) can be applied to approximate the contour integral 
| |59) . This strategy is fundamental to asymptotic analysis in 
analytic combinatorics 

It is the completely undisputed and lasting contribution of 
Mahler to have introduced the notion of random hnite sets into 
the target tracking community. Furthermore, he was the hrst 
who derived - among other contributions - a popular multiple 
target tracker by using PGFLs within this framework. For 
several years, however, there has been an ongoing discussion 
in the tracking community of how to formulate the underlying 
mathematical structures and to link them to the classical 
mathematical theory of hnite point processes that has been 
developed over decades. 

It is the express intent of this paper to bring different 
schools of the tracking community together by demonstrating 
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that PGELs are very precise and succinct models of the 
combinatorial probability structures involved in multitarget 
tracking, whether or not these models arise from a classical 
or a random set approach. The family of pointillist filters and 
the connections to analytic combinatorics were first proposed 
and discussed in | [55j . The first example of a tracking filter 
explicitly derived from a PGEL was the PHD intensity filter 
p2] . The first application of PGELs and the analytic combina¬ 
toric method to filters that do not employ target superposition 
was given for the PDA filter | |54| . 

The paper is organized as follows. Section [I^ discusses 
the combinatorial assignment problems that are fundamen¬ 
tal to multitarget tracking problems in the context of ana¬ 
lytic combinatorics. Section m discusses superposition and 
marginalization of target processes in terms of PGELs. General 
notation and models are presented in Section [fV| to f ix ideas, 
linear-Gaussian versions are provided in Section |IV-A| Section 


IV-B presents models for target detection, and Section IV-C 


discusses the clutter modelling. 

Section |V] (Section 0 presents the class of pointillist 
filters that do not (do) employ target superposition. Hybrid 
pointillist filters are discussed in Section VII Closing the 
Bayesian recursion and target state estimation are briefly 
discussed in Sections VIII| and IX respectively. Section 
highlights the benefits of the paper for engineers. In partic¬ 
ular a concrete non-standard example involving unresolved 
targets is presented to show how to model a tracking Alter 
using PGELs. Alternative methods for deriving functional 
derivatives—needed to derive the probability distribution and 
summary statistics of the Bayes posterior point process—are 
discussed in Section Section XII gives conclusions and 
provides a table that overviews many of the Alters discussed. 
Appendix gives a brief overview of the basic concepts 
of finite point processes. Appendix shows that a stacked 
random vector differs from a random finite set defined as a 
realization of a finite point process. 


H. Combinatorics and Tracking 

Combinatoric problems are fundamental to multitarget 
tracking involving sensors whose output measurements are 
modeled as points. These points may belong to targets, or 
they may be spurious, that is, clutter measurements. Different 
applications impose different constraints on the choices, or 
assignments, of points to targets. Assignments that satisfy the 
constraints are said to be feasible. The prototype constraint 
is widely used “at most one measurement per target rule” 
in which a target is assumed to generate at most one point 
measurement. Whatever the constraints are, the combinatoric 
problem is to model the feasible assignments in a probabilistic 
manner that supports target state estimation. 

The natural way to study combinatorial problems is to 
enumerate all feasible choices | |46l . This method is often 
impractical in large problems. Analytic combinatorics pO) 
is a classical alternative method that has proved useful in 
many kinds of counting problems (e.g., how many ways can 
a convex polygon with n + 2 sides be decomposed into n 
non-overlapping triangles?). The basic strategy comprises two 
steps. The first and often most difficult step is to discover the 
generating function (GE) that models the problem at hand. 
This step typically exploits the structure of the problem, e.g., 
a recursion, to find a closed form expression. The GE, once 


found, is an analytic function in the complex plane. The second 
step is to invoke analytic methods to study the GE to find 
expressions for the number of combinations. The behavior of 
GEs near their poles in the complex plane often determine 
asymptotic behavior 0. 

The PGEL approach is the analytic combinatoric method 
applied to the study of the multi-target tracking assignment 
problems. Specifically, GEs are replaced in tracking applica¬ 
tions by generating functionals and, most often, PGELs. 

Einding and exhibiting the PGELs that model the various 
tracking filters in the pointillist family is the Discovery Step 
of analytic combinatorics. Deriving the filter and closing the 
Bayesian recursion is the second Analytical Step. These steps 
are discussed and used in Section to aid engineers in the 
design of tracking filters. In addition to Sectio n [X| comments 
on closing the recursion are given in Section VIII and when 
discussing specific filters. Implementation details and filter 
performance are not within the scope of the paper. Interested 
readers are referred to the literature. 


III. Superposition and Marginalization of Einite 
Point Processes 

It is assumed that readers have some familiarity with finite 
point processes and PGELs in a general setting. The appendix 
of this paper gives a highly abbreviated presentation of the 
basic facts used in this paper. The authoritative texts on point 
processes are m and 0, and the fundamental paper on 
PGELs is p7| . The first applications of PGELs in a tracking 
context can be found in p4| and the papers and . More 
approachable expositions of relevant background material can 
be found in p3[ and p8] . References to specific filters are 
given where they are first discussed in technical detail. Better 
known filters are referenced to accessible textbooks wherein 
citations to the original literature can be found. 

Superposition and marginalization of finite point processes 
are fundamental concepts that are easily described in terms of 
the underlying PGEL. The case of several mutually indepen¬ 
dent processes is treated first since it is the most intuitive. 

Suppose that the PGELs of n > 1 target processes are 
specified on the spaces Si,i = 1,... ,n. These spaces need not 
be the same, so each target has its own test function. Denote 
the PGEL of process i by A>i{hi), where hi is a complex¬ 
valued test function defined on Si, that is, hi : Si —t' C. Then 
the joint PGEL of the n processes is defined by 

n 

( 1 ) 

i=l 

The product form of the joint PGEL A>{hi ,..., /i„) holds if 
and only if the processes are mutually independent. When this 
is not the case, the PGEL must be specified in such a way as 
to account for the correlation between the processes. In any 
event, realizations of the joint process are Cartesian products 
of finite point sets in the spaces Si. 

In the language of PGELs, marginalizing over a point 
process is equivalent to setting its test function identically 
equal to one. Thus, the PGEL of the marginal process is 

At,ih,) = AJ{...,l,h,,l,...). (2) 

Realizations of the marginal process are finite point sets in 
the space Si. When the processes are mutually independent. 
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the PGELs of the marginal processes are identical to the factors 
in the product Q. 

The n processes can be superposed if the spaces Si are 
copies of the same space, which is denoted by S and referred 
to as the ground space. There is only one state space so there 
is only one test function. The PGEL of the superposed process 
is given by the ’’diagonal” of the joint PGEL, i.e., 




(3) 


where the test function of the superposed process h{-) is 
defined on S. Realizations of the superposed process are finite 
point sets in S. 

The filters of Section |V] do not use superposition, and 
therefore have as many (possibly different) target state spaces 
and test functions as there are targets. The pointillist filters of 
Section VI do use superposition, and thus all targets share the 


same state space and there is only one test function. Einally, 
the hybrid pointillist filters discussed in VII superpose some 
targets and not others. The superposed targets share a state 
space but non-superposed targets do not. 


IV. Notation and Models 


In this section a consistent notation for the target and 
measurement models used in pointillist filters is presented. 
Specific probability distributions are not assumed. 

The initial reference time is denoted by to. The measure¬ 
ment sample times are denoted by k = 1,2,.... It is 
assumed that < tk for all k. The recursive time index 
is suppressed throughout for ease of presentation. 

Targets are assumed to move over time in a state space 
denoted by S, where S C Measurements are points 

in a measurement space Y C 

The general models used for pointillist filters that do not 
superpose targets are discussed in this section. To introduce the 
meaning of the notation, equivalent versions for the standard 
linear-Gaussian Kalman filter | [2^ are given in Section IV-A2| 
Point target and extended target detection models are discussed 
in Section |IV-B| Clutter modelling is described in Section 

USD 


A. Target Motion and Measurement Models 


Three of these PDEs are determined by the others: 


p(x) = / po{xo)po{x\xo) dxo 

Js 

(4) 

p{y) = j p{x)p{y\x)dx 

(5) 


(6) 


The last expression is Bayes Theorem. 


2) Linear-Gaussian Models: The general notation takes a 
familiar form for the classical linear-Gaussian Kalman filter 
J^ . The target state space is S' = and the measure¬ 
ment space is y = . The time index is included in this 

subsection. Standard Kalman filter notation is adopted. Starting 
the recursion at time let 

To{xo) = N{xo]Xk-i\k-i,Pk-i,k-i), (7) 

where Xk-i\k-i & S and Pk-i\k-i & ]^dim(S)xdim(S) ^j.g 
the specified mean vector and covariance matrix, respectively, 
at time tk-i- The target motion model is Xk = Fk-iXk-i Y 
Wk-i, where Fk-i S ]^dim(S)xdim(S) jg specified and the noise 
vector Wk is Gaussian distributed with zero-mean and known 
covariance matrix Qk-i G Rdim(S)xdim(S) 'pjjg motion 


model written in PDE form is 

p{x\xo) = J\f{x-, Fk-iXo, Qk-i)- (8) 

The predicted target PDE at time tk is 

H{x) = Af{x;Xk\k-i,Pk\k-i), (9) 

where the predicted mean and covariance matrix are given by 

^k\k — l Pk — l^k — l\k—l (lH) 

Pk\k-1 =Fk-lPk-l\k-lFk-i + Qk-1- (11) 


The measurement model is yk = HkXk + Vk, where the 
measurement matrix G ]^dim(F)xdim(S) specified and Vk 
is an additive noise vector that is zero-mean and Gaussian dis¬ 
tributed with known covariance matrix Rk G Kdim('V)xdim(Y) 
The measurement likelihood function is 

piy\x) = N{y\HkX, Rk). (12) 

The predicted measurement PDE is 


I) General Probability Models: A target prior probability 
density function (PDE) is specified at the recursion start time 
to = tk-i- Six PDEs are needed: 

• po(xo), the (prior) target PDE at time tk-i, 

• po{x\xo), the Markovian target motion (transition) 
model from xq G S' at time tk-i to x G S at tk, 

• y{x), the predicted target PDE at time tk, 

• p(yjx), the likelihood function of a measurement y G 
Y at time tk conditioned on target state x G S at 

• p{y), the PDE of a measurement at time tk condi¬ 
tioned on the sequence of all measurements up to and 
including time tk-i, 

• p(xjy), the Bayes posterior PDE at time tk conditioned 
on measurements up to time and including time tk- 


p{y) = J^{y\yk\k-i,Sk), (13) 

where the predicted measurement and its covariance matrix are 
yk\k—l PkXk\k — l (14) 

Sk = HkPk\k-lHk Y Rk- (15) 

The measurement yk G Y is given at time tk- The innovation 
(residual measurement error) at time tk is defined by tk = yk — 
yk\k-l^ so Sk G Kdim(Y)xdim(Y) j-j^g innovation covariance 
matrix. The Kalman filter output is the posterior PDE 


p{x) = Af{x;Xk\k,Pk\k), (16) 

where the updated mean and covariance matrix are 

^k\k ^k\k—l ^^k ^k ( 1 ^) 

Pk\k=Pk\k-i-WkSkW^, (18) 

Wk=Pk\k-iH^S]:\ (19) 
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The matrix Wk G ]^dim(S)xdim(F) filter gain matrix. 


B. Target Detection Modeling 

1) Targets With At Most One Point Measurement: Missed 
target detections are modeled by assuming that a target that is 
known to be present at state x G 5 at time is detected with 
probability P^{x), where 0 < Pj^{x) < 1. The probability of 
missing the target detection is then 1 — P^{x). Suppressing 
the recursion time index, for all a: G S' let 

a(a;) = 1 — P^(a;) and b{x)=P^{x). (20) 

The corresponding PGE is defined by 

G^M^[z) = a{x) + h{x)z, ( 21 ) 


widely used in diverse applications | [5^ . 

PPP clutter is considered first. Denote the intensity function 
of the clutter process at time by \k{y)- The PPP is 
homogeneous if Afc {y) = constant on Y ; otherwise, it is non- 
homogeneous. Time dependence is suppressed in the notation, 
so that Afc(y) is written \{y). The mean number of clutter 
points in Y is 

A = y X{y) dy. (26) 

To assure that A is finite (and simplify the discussion), it is 
assumed that Y is bounded. Define the clutter PDE Phiv) to 
be the normalized intensity function, 

PA[y) = X{y)/K. (27) 


where z G C. Target detection probabilities may or may 
be not be the same for all targets, depending on the application. 


2) Extended Target Model: An extended target is assumed 
to have a well defined state a; G S', e.g., an appropriately 
defined centroid. It is assumed that extended targets generate 
a random number M > 0 of independent and identically dis¬ 
tributed (point) measurements in the space Y. The distribution 
of a target-originated measurement is taken to be the likelihood 
function p{y\x). The number of measurements can also depend 
on the target state. The conditional PGP of the random variable 
M is defined by 

OO 

GmU^) = Y. Pi' (22) 

m—0 

where Pr{M = m\x} denotes the probability that m mea¬ 
surements are generated by the extended target with state 
X G S. The target is said to be detected if M > 1. Por M > 1, 
let 


dm{x) = Pr{M = ml target at state x is detected}, (23) 


so that J2m=i ^mix) = 1- Using the probabilities (201 gives 


In this notation, the PGPL of the PPP clutter process is pO) 

= exp ^ - A -b A J g{y)pA{y) dy^ , (28) 

where the test function g : Y —> C is, like all test functions, 
bounded and integrable. 

The PGPL of clutter when modeled as a cluster process is 
^ciuster(^) ^ g{y)q{y)dy^ , (29) 

where q{y) is the PDE of a clutter point y & Y and Gc{z) is 
the PGP of the number of clutter points, namely, 

OO 

Gc(^)=^Pr{C = c}z^ (30) 

c=0 


where Pr{G = cj denotes the probability that c clutter 
measurements are generated. Note that ( [28| ) is a special case 
of (29 1 for Gc{z) = exp(—A -b Az). 


V. Pointillist Pilters without Superposition 


GM\xiz) = a(x) + b{x) Y dm{x)z'^' 


m—1 


= a{x) + b{x) Goixiz), 


(24) 

(25) 


where Gd^x{z) is the PGP of the number of measurements 
generated by a detected target at x. It reduces to the 
“at most one measurement per target” model for M = 1, 
that is, to in (21 1 when di{x) = 1 and Gj)\^{z) = z. 


C. Clutter Modeling 

The clutter process (also called the false alarm process) 
in the measurement space Y is assumed to be either a non- 
homogeneous time-dependent Poisson point process (PPP) 
or a generalization called a cluster process. Por all cluster 
processes, including PPPs, given the number of points, the 
points are i.i.d. (independently and identically distributed). 
The PGP (or z-transform as it is called in signal processing 
applications) of the probability distribution of the number of 
points plays a key role. PPPs are flexible, well understood, and 


The joint PGPLs of several well-known recursive discrete¬ 
time tracking filters that do not superpose targets are exhibited 
in this section. The general notation of Section |lV] is used, and 
the recursive time index is suppressed. 


A. Bayes-Markov Filter 

1) Classical Problem: The classical Bayes-Markov filter 
|5g is presented in the following. It assumes exactly 
one target to be always present. It is always detected, and 
it generates exactly one measurement. There is no clutter, 
i.e., spurious measurements unrelated to the target. Denote the 
target and measurement test functions by /i : S' —>■ C and 
q : Y —>■ C, respectively. Both are bounded and integrable. 
The PGPL is 


4'bm(^,5)= / / h{x)g{y)p{x)p{y\x)dydx. (31) 
Js Jy 


As a check, note that 4 'bm(1,1) = g)lh(-)=i,g(-)=i = 

1, due to the fact that fj,(x) and p(jj\x) are PDPs. 
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2) With Missed Detections: Missed target detections are 
modeled using (20i l|26), 1^. The joint target-originated 
measurement process is 


^BMoiKg) = yj^ 9 {y)piy\x)dyj dx 

= j Hx)yix)(^a{x)+ b{x) J g{y)p{y\x)dy^dx. (32) 


As a check, note that 'I'bmd(I) 1) = 1- This expression shows 
up often in the sequel. It reduces to when the target 
detection probability is one. 


3) With Missed Detections and Extended Target: The PGE 
(j2^ of the number of measurements generated by the extended 
target with state x & S m assumed to be known. The PGEL 
for the extended target is p6) , fSO) , 


T'bme(^, g) 


h{x)fi{x)GM\2 


9{y)piy\x) dy^ dx. 

(33) 


If the target can generate at most one measurement, then M G 
{0,1} and GmIx(^) = o,(x) + b{x) z, z G C. Thus, (32i is a 
special case of 


where the expected number of clutter points in the gate is 

\T\=J^X{y)dy (37) 

and the clutter PDE is the intensity normalized by gate volume; 


Vr(v)-i ^(2/)/|r|, ifj/GT 
“10, ify^r. 


(38) 


Censoring target measurements lying outside E gives (cf. (32 1 ) 




gated 


BMD (Kg) 

= j h{x)g{x)(^ar{x) + br{x) J g{y)pr{y\x)dy'^dx, 


where the probability that a target at a; € S' is detected within 
the gate is 


br{x) = b{x) Pr, (39) 


so that Or (a;) = 1 — b{x)Pr is the probability that it is not 
detected in the gate, and the conditional PDE of a gated target- 
originated measurement is 


Pr{y\x) 


(Pr) Xyk), ifyer 
0, if y ^ r. 


(40) 


B. PDA Filter 


1) Without Gating: In the PDA problem Q, Q, as in the 
classical Bayes-Markov problem, exactly one target is assumed 
to be present. However, the target may or may not be detected, 
and clutter may be also present. Realizations of the clutter 
process and the target measurement process are superposed in 
the sensor measurement space Y. The target measurement and 
clutter processes are assumed to be independent. The PGEL 
of superposed independent processes is the product of the 
PGELs of the constituent process es ||49| . The target-originated 
measurement PGEL is given by (|32|l and the clutter PGEL is 
(|28|l, so the product 


n‘S'r{h,g) = 'fBMD{h,g)A>^J^{g) 

= j h{x)p{x)(^{x)+ b{x) J g{y)p{y\x)dy'^dx 

xexp^-A + Aj g{y)pA{y)dy'^. 


(34) 


is the PGEL of the superposed processes. 


Using these expressions gives 


= J h{x)p,{x)(^l-b{x)Pr + b{x) J g{y)p{y\x) dy^dx 
X exp ^ - |r| -b |A| J g{y)pA{y)dy^. (41) 


This expression holds only for y GT. The gated PGEL reduces 
to the PGEL for ungated measurements when the gate is the 
entire measurement space, i.e., E = U. 

3) Extended Target: Given the PGE ( |22) l of the number of 
measurements generated by the extended target, the PGEL for 
the ungated extended target is the product 


T'pDAE(^,y) = 'I'bme(/i,5) 

= J^h{x)p{x)GM\x(^J giy)p{y\x)dy'^dx 

X exp ^ - A -b A J g{y)pA{y) dy^ . (42) 


2) With Gating: The PGEL is modified to accommodate 
gating. The gate, denoted by E, is a specified subset of Y. The 
probability that a target-originated measurement falls within 
the gate is 

Pr = Pr{yGT} = j^p{y)dy. (35) 

The gate can be chosen arbitrarily at time tk so long as Pr{y G 
E} 7 ^ 0; however, it is typically chosen so Pr{y S E} is near 
one. Gating the PPP clutter process to E yields a PPP | |5^ 
whose PGEL is given by 

^pppgatedj.^^ = exp ^ _ |E| -b |E| g{y)pr{y) dy ^, (36) 


Gating the extended target measurements requires counting the 
number of ways that m target-originated measurements can fall 
within the gate. It is not considered further here. 

C. JPDA Filter 

Exactly n > 1 targets are assumed to be present. Clutter 
is assumed to be present. Target measurement and clutter 
processes are assumed to be mutually independent. Gating can 
be accommodated in the manner outlined below for the JIPDA 
filter in Section |V-E2| so it is not done here. 

The targets may or may not be detected. If detected, a 
target generates at most one measurement. All measurements 
generated are superposed in the same measurement space 
Y. Which measurements are target-originated and which are 
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clutter is unknown. The PGEL of the i* target measurement 
process is given by 


BMD 

Mi\x 


9 iy)p{y\x)dy] dx (43) 


^BMD(i)(^ij g) 

= / hi{x)ni{x)G 
JSi 

= J hi{x)y.i{x)(^ai{x)+h{x) J g{y)pi{y\x)dy^dx, 

where the quantities in this expression are the same as in (32 1 
but with an index i added to make them specific to the ith 
target. The state spaces Si can be different or they can be 
copies of the same space. 

Target processes are not superposed in a common state 
space. Consequently, a different test function hi{x) : 5^ —>■ K 
is necessary for each target. In contrast, the same measurement 
test function g is used because all measurements, whether 
target-originated or clutter, are superposed in the measurement 
space Y. 

Because clutter and target measurement processes are mu¬ 
tually independent, the joint PGEL is the product of PGELs; 

n 

4'jPDa(/i1, • ■ • , hn,g) = 5)j (44) 


i=l 


where is given by (28 i. 

The PGEL of the marginalprocess for the jth target, j = 
1,..., n, is found by setting hi{x) = 1 for i j. Explicitly, 


3PDA(j){dj, g) — 

n 

i=l, 


(45) 


The marginal PGELs are used for approximation purposes. 
They do not characterize the full JPDA distribution. Eor the 
now-standard derivation of the JPDA filter see 0, 0- 


This expression differs from the JPDA model (j4^ only in 
the choice of the conditional PGEL of the measurements. 
The i* target model reduces to the extended target mea¬ 
surement model when the number of measurements is 
Poisson distributed with mean Ai{x), that is, when Gm\x{z) = 
exp(-Ai(a;) + A^{x)z). 

Targets are mutually independent by assumption, so the 
joint PGEL of the PMHT filter is the product of the target- 
specific PGELs, 

n 

4'pMHt(^1, ■ ■ ■ ,hn,g) = Y\_ 'f'BMD(i)(^*’ ff)' (48) 

i=l 


As with JPDA, different targets have different test functions. 
PMHT, unlike JPDA, models clutter using a “diffuse” target, 
that is, as a target with a high measurement variance. 

The PGEL ( |48] l is, by inspection, a linear functional (see the 
discussion following ( 124| l) in the test functions hi,...,hn. 
Erom the discussion in Appendix A-2 it follows that realiza¬ 


tions of the joint target point process have, with probability 
one, exactly one target in each state space, and that the Bayes 
posterior PDE and the intensity functions are identical. 

Denote a realization hy Xi G Si, i = l,...,n. It can 
be shown that the joint posterior PDE is proportional to the 
product (over the measurements) of a probabilistic mixture of 
measurement likelihoods. The n mixing proportions are 


/ " 

TTjjxi) = Ai{xi) / Aj(a 
‘ i=i 


for i = 1, 


(49) 


This result is reasonable given the well-known relationship 
between Poisson mixtures and the multinomial distribution 
|27), 0. Details are straightforward and are omitted. 
The original PMHT filter assumes that the spatial rates are 
constant, Aiix) = Ai, and applies the EM method to find 
point estimates for the n targets and other parameters. 


D. PMHT Filter 


The PMHT filter and enhanced versions were published in 
It is assumed that there are exactly n > 1 
targets present. Target-originated measurements are modeled 
as PPPs that are conditioned on target state x G S. The mean 
number of measurements generated by the target is denoted 
by Ai{x), and the conditional likelihood is pi{y\x). The PGP 
of the target-originated measurement process conditioned 
on a target at cc S 5” is {cf. @) 

= exp ( - Ai{x) +A^{x)z). (46) 


Although not formulated in terms of PGELs, Poisson models 
for the number of measurements were first incorporated into 
the PMHT and histogram PMHT filters in p6t , | [^ . 

The PGEL of the i* target is 


4'BMD(i)(^j)5) 

= [ hi{x)fJ,^{x)G' 

Js 


PPP 

Mi\a 


giy)pi{y\x)dy] dx 


'Y 


= j hi{x)pi{x)eyip(^ - Ai{x) + Ai{x) J g{y)pi{y\x) dy'^ . 

(47) 


E. IPDA Filter 

The IPDA presented in i|4g, gg, @ 0, 0 shares 
all the assumptions of the PDA except for one — it assumes 
that at most one target exists. Let xo, 0 < xo ^ denote 
the initial probability that the target exists, i.e., that there is 
= 1 target. Let x denote the updated probability of target 
existence at current time (see ED for details). The PGP of the 
number of targets at the current time is therefore 

Gn{z) = 1-x + XZ. (50) 


The PGEL for the target-generated measurement process is 
4'lPDA(/l,ff) = GAr(T'BMD(/l, ff)) 

= 1 - A +A 4 'BMD(/i,g), ( 51 ) 

where ^'bmdC^;'?) is given by (l^i. Superposing an indepen¬ 
dent clutter process gives the PGFL for IPDA as 

4 'iPDA(ft., 5 ) = (l - A + X ^bmd(/i, g)) ^c^{g), ( 52 ) 

where the clutter PGEL is given by (28 1 . 

In 0 the IPDA filter is derived using random sets. In 
^ and further versions of the IPDA filter are presented. 
These investigations may represent another class of pointillist 
filters, but they are outside the scope of this paper. 
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F. JIPDA Filter 


1) Without Gating: The JIPDA filter was presented first 
in m (see also Q, ||44)). The JIPDA process assumes that 
at most n > 1 targets exist. Let Xi denote the existence 
probability for target i, i.e., the probability that target i is 
present. The target-originated measurement process for target 
i is, using the same notation as in (j4^, 

'I'lPDA(i) {hi, g) = 1 — Xi + Xi ^BMD(i) {hi, g) ■ (53) 

Different test functions hi are needed because the each target 
has its own state space. Assuming the target-originated mea¬ 
surement processes are mutually independent of each other and 
of the clutter process gives the PGEL for JIPDA as 

n 

^'jIPDa(^ 1 , ■ • • , hn,g) = ^'lPDA(i)(^i, d) 

i=l 

= ^c'’(5)n + (54) 

i=l ' 

As checks, note that 'I'jipda( 1, • ■ • > 1) = 1 and that 4 'jipda(') 
reduces to ^'jpda(-) for Xi = 1- 

Analogously to the JPDA, the PGEL of the marginal 
process for the jth target, j = can be determined by 

setting hi{x) = 1 for i ^ j. It is given by 


^SlPDA{j){hj,g) 

= - Xj + Xj '^BMD(j){hj,g)^ 

X f 1 ~ Xi + Xi'I'BMD(i)(l)5)V (55) 


The marginal defined in (55 1 can be used in various ways 
to approximate the joint distribution and thereby close the 
Bayesian recursion. 

In p^ , a closely related version of the JIPDA is presented 
using random sets. The versions differ only in the method used 
to start new tracks. 


2) With Gating: Gating is incorporated in a manner anal¬ 
ogous to that of Section |V-B2 for the PDA. Denote the 
measurement gate of the target by T^ C E, i = l,...,n. 
Let r = yyy^iTi. Assuming PPP clutter, the expected number 
of clutter points, |r|, in E is given by ( [j^ . The corresponding 
clutter PDE pr(2/) is given by (38i. The gated clutter process 
is a PPP, and its PGEL is 




PPPgated 
C 


{g) = exp (^ - |r| -b |r| g{y)pr{y)dy^. (56) 

Let Ppi denote the probability that a target-originated mea¬ 
surement falls within the gate of the fth target, that is 


Pr, = Pr{y G EJ = J p{y) dy. 


(57) 


Then, the PGEL of the joint target-measurement process for 
the fth target is given by 


^bmd(.)(^-5) 


= J^h{x)fi{x)(^ar,{x) + br,{x) J g{y)pri{y\x) dy^ , 

(58) 


where the probability that the ith target at a; G S'i is detected 
within the gate is 


br, (x) = b{x) Pr,, (59) 

so that ari(x) = 1 — b{x)PT^ is the probability that it is not 
detected in the gate, and the conditional PDE of a gated target- 
originated measurement is 


PT,{y\x) 


f(7^r.) ^p{y\x), ift/GE, 
lO, if y^Vi, 


(60) 


which means that each target has its own gate and its likelihood 
function is normalized, so that it is a PDE within this gate. 
Therefore, the PGEL of the Bayes posterior process of the 
gated JIPDA filter is given by 

= n (l - X. + X. («> 

i=l ^ ^ 

The PGEL of the marginal process for the JIPDA including 
gating is determined analogously to Section [V-P1| 


G. MET Filter 


In the standard Multi-Hypothesis Tracking (MHT) algo¬ 
rithm, presented in pb] , the filter generates a set of data 
association hypotheses to account for all possible origins of 
every measurement. In a specific iteration the hypothesis are 
generated, that a certain target is a false alarm, belongs to 
an existing target, or occurs due to a new target. Based on 
the target states various techniques for reducing the numerical 
complexity like gating, merging, pruning were proposed by 
the community. However, there is only one un-pruned and 
complete set of MHT hypothesis. 

Assume that the JIPDA filter derived from ( |54l l is extended 
by a process that allows data induced targets. Then, assuming 
m measurements were received it is given by 


■ j hn+m, ff) 

= 4'c'’(ff) W[^-Xi + X^ 4'bmd(*) {h^, g) 


2=1 


X f 1 Ij Ij ^BMD(j}{hn+ji g)\ i (62) 

i=i ^ ^ 


where Xj denotes the probability that measurement j was 
generated by a target that is not modeled yet, and 


4'BMD(j)(^j;5) 

= hj{x)^j{x)G^f^^ g{y)pj{y\x) dy'^ dx (63) 


is a data-driven PGEL that assumes that this new target 
has (specified) a priori PDE ^j(x) and generated a 
measurement with conditional PIDE pj{y\x). The test 
functions hn+i, ■■■,hn+j correspond to the m data-induced 
PGELs. 

The PGEL given by the data-driven JIPDA in ( |62) l captures 
the complete un-pruned set of all hypothesis of a single scan 
MHT algorithm. The correct Bayesian way to derive the target 
state estimates is to apply marginalization over all except one 
target state, exactly as it is done in the JIPDA filter. 
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To keep the numerical complexity feasible, many 
approximation methods (gating, merging) and decision 
techniques (pruning) can be applied to close the Bayesian 
recursion. These considerations are outside the scope of this 
paper. 

The close relationship between the IPDA and the MHT 
filters was first noted in | |42l . However, the connection was 
not discussed there in terms of PGELs. The connection 
between MHT, JIPDA, and multi-Bernoulli filters has been 
studied using the framework of random finite sets in | [72) . 

The study of a multiple scan MHT filter, i.e., an MHT 
filter that keeps measurement origin hypotheses for several 
successive scans, has already been studied and will be 
presented in a further publication. 


VI. Pointillist Eilters with Superposition 

In this section the PGELs of filters with target superposition 
are presented, that is all targets share the same target state 
space. 

The general notation of Section IV is used throughout, and 
the recursive time index is suppressed. 


A. Superposition in JPDA and Other Multitarget Filters 

The multitarget processes in Section [V] are such that each 
target had its own state space. When the state spaces are copies 
of the same space, then it is possible to superpose the target 
processes into one process on the common space. It is seen 
in this section that it is straightforward to superpose targets 
once the joint multitarget PGEL is known. In the following it 
is shown how three originally non-superposed filters can be 
superposed. 

One problem with superposition is the loss of target- 
specific labels. As noted in Section IX estimating target- 
specific PDEs from the superposed process requires separate 
procedures that are replete with their own special difficulties. 
Another problem, noted below in Section VIII is that 


it mis-models multiple targets. The PDE p( ) plays a more 
significant role for superposed targets than it does for one 
target. This is because the target states are (typically) i.i.d. 
samples from a random variable whose PDE is pl{-). The mis- 
modeling would disappear only if the samples could somehow 
be drawn without replacement from different targets. 


1) JPDA: The joint probability distribution in JPDA is 
marginalized over all but one of the n targets to obtain a 
target-specific PDE. When the target state spaces are all the 
same, i.e.. Si = S for all i, target point processes can be 
superposed instead of marginalized. The PGEL of the JPDA 
with superposition (JPDAS) filter is found by substituting 

hi{-) = h{-) (64) 

into the PGEL ( [44| for the JPDA. The resulting PGEL is 
4'jpDAs(/i, (?) = 4'JPDA (/i, ■ ■ ■ ,h,g) 

n 

= (65) 

i=l 

In JPDAS, as in JPDA, targets can have different motion 
models, measurement models, and detection probabilities. Note 
too that the superposition procedure can be limited so that it 
includes some targets and not others. 


2) JIPDA, PMHT, and Other Filters: Targets can be super¬ 
posed in other joint multitarget point processes with known 
PGELs when identical target state spaces are assumed. The 
PGEL of the superposed process is found by substituting ( [64| l. 
Eor example, from ( [54] ) and ( |48| l, 

'I'jiPDAs(^j g) =4* JIPDA (^, ■ ■■ ,h,g) (66) 

4'pmhts g ) =4'pmht(/i, ■ ■ ■ ^h^g) (67) 

are, respectively, the PGELs for JIPDA and PMHT with 
superposition. 


B. PHD Intensity Filter 


The probability hypothesis density (PHD) filter is proposed 
in p4) and the papers 1321 and p3j . The intensity filter (iPilter) 
is proposed in | |52| . The difference between these filters lies in 
their state space. While the PHD filter uses the state space S 
(typically S = d > 0), the iPilter uses an augmented state 
space S'^ = SU (p, where S is equal to the state space of the 
PHD filter and cp is the hypothesis space, used to model clutter 
by scatterers. By a careful choice of certain parameters the 
Bayes posterior of the iPilter on 5'+ is the same as the Bayes 
posterior of the PHD filter. Eor details see [ [501 . A detailed 
comparison of both filters using the PGEL derivation can be 
found in fS?) . 

Targets are assumed to have the same state space S and 
are superposed. It is assumed that the targets constitute a 
finite point process at the recursion start time. This is the 
prior process at the initial (start up) time, but thereafter 
it is the Bayes posterior point process. The PGEL of the 
process, denoted by A>Q{h,g), is typically not a PPP, so it is 
approximated by a PPP to close the Bayes recursion. To this 
end, the joint conditional PDE is approximated by the product 
of the marginal PDEs. Details can be found in | [34) and | [52) . 

It is important that the predicted target process be a 
PPP. This is guaranteed to be the case assuming independent 
probabilistic thinning (target death), and assuming independent 
Markovian targets having the same motion model pq{x\xo) of 
Section IV-Al It also holds if a new-target (birth) process is 


superposed, provided the birth process is a PPP independent 
of the target process. 

The details of the predicted target PPP are of little concern 
here - it suffices to assume that it is a (recursively) specified 
PPP with intensity N p{x), where the predicted number of 
targets is N = E[N], N being the random variable modeling 
the number of targets, and /i(x) is the predicted target PDE 
The mean N and PDE p,(x) are determined by details of the 
prediction process. (In particular, note that /i(x) is not given 
by 0 except in special cases.) 

Since N is Poisson distributed with mean N, its PGP is 
(when there is at most one target, see (|50ll) 


G^P(z) = 


( 68 ) 


The predicted target states are i.i.d. by PPP assumption and are 
drawn from the PDE g{x). The target measurement functions 
and detection probabilities are assumed to be the same for 
all targets. Hence, the PGEL of the predicted target-originated 
measurement process is 

g) = g)) , (69) 








Submitted to IEEE t-aes 2015003S0 on 29.05.2015 


where ^'bmd(^, 5 ) is given by (32i. The target-originated mea 


surement process is superposed with the independent clutter 
process, so the PGEL of the PHD filter is the product 

(70) 

Substituting ([28|, ([M|, and gives the explicit form 


= exp -A-N + A / g{y)pA{y)dy 


+ N J h(x)y(x)\^a{x) + b{x) J g{y)p{y\x) dy jdx j . 

(71) 


approximating process is taken equal to that of the original 
process: 

The probability distribution of 5|T conditioned on n targets 
is defined by the product approximation 

n _ 

p“l'^(si,...,s„) = (74) 

where the PDF for a single target is defined by 

(75) 


It has been noted (see, e.g. | |50) and | |5^ ) that the math¬ 
ematical form of the PGEL of the Bayes posterior process - 
before approximation to close the Bayesian recursion - is the 
product of a PPP and m Bernoulli target processes, where 
m denotes the number of measurements. In other words, the 
Bayes posterior process is the superposition of m Bernoulli 
processes and a PPP. This form is closely related to that of the 
multi-Bernoulli filters discussed below in Section IVI-EI 

In I?) the connection between a Gaussian Mixture imple¬ 
mentation of the PHD intensity filter and the JIPDA filter is 
investigated. It is shown that under certain conditions (each 
target has a linear Gaussian dynamical model, target survival 
and detection probabilities are state independent, no explicit 
target birth and spawning event) the composite density (of the 
JIPDA) conforms to the definition of probability hypothesis 
density (see ||^ Sec. 4]). 


C. CPHD Intensity Filter 


The CPHD intensity filter is proposed in p^ , p4) . It 
propagates besides the intensity additionally the cardinality 
distribution and its PGEL. The assumptions made are essen¬ 
tially the same as in the PHD intensity filter. The differences 
are that the CPHD intensity filter propagates the cardinality 
distribution \\29) , that the clutter process is given by an i.i.d. 
cluster process as defined in (29i and that the PGP of the 
number of present targets is also given by an i.i.d. cluster 
process, that is 


In words, the approximate process S|T is a cluster process 
whose PGP of target number is chosen to be equal to that of the 
posterior process, and whose points are i.i.d. distributed with 
PDP proportional to the normalized intensity of the posterior 
process. 


D. Generalized PHD Intensity Filters 

The generalized PHD intensity filter was first proposed 
in m and shares most of the assumptions of the standard 
PHD intensity filter presented in Section VI-B The predicted 
target process is a PPP and the updated target process is 
approximated by a PPP due to the same arguments as in 
Section VI-B| The differences are the target-originated mea¬ 
surement process and the clutter model. In Section VI-B| it is 
assumed that one target generates at most one measurement 
per sensor-scan. Eurthermore, it is assumed that the clutter 
model is given by ( [28] l, that is clutter is Poisson-distributed. 
The generalized PHD intensity filter relaxes these assumptions. 
Eirst, the clutter model can be chosen arbitrary. Therefore, 
let 'I'^“(p) be the PGEL of a specified, but arbitrary, clutter 
process. Eurthermore, let 

4'Imd(^:5)= f h{x)p{x)(pnmx)+ 


SsLn g{yi)Pn{yi, ■■■,yn\x) dyi...dynj dx (76) 


00 

G“(z) = ^p“(n)z", (72) 

n—0 

where is the distribution of the number of targets in 

S. The joint PGEL of the CPHD intensity filter is 

4 ^ cphd (<?, h) = G“('Pbmd(5, h)). (73) 


The PGEL of the CPHD intensity filter reduces to the PGEL 
of the PHD intensity filter if the target and measurement 
processes are both given by PPPs. 

In p5j a CPHD filter is proposed that uses a fixed number 
of targets. Note that if this CPHD filter employs a clutter 
process that is given by a PPP it is equivalent to the JPDAS 
filter proposed in ([65] l. 

The PGP (z) and the intensity /“l^(s]of the Bayes 


posterior process S|T are defined as in 


and (128 1 


with respect to the joint PGEL T'cphd( 5 j n7~To close the 
Bayesian recursion, the posterior process is approximated by 
a point process S|T. The PGP of the number of targets in the 


be the PGEL of the target measurement process, where 
Pn{yi, ■■■,yn\x) denotes the generalized symmetric likelihood 
function, which is defined on E". As mentioned in E) 
the probability of detection is defined more generally than 
for the standard PHD filter. Therefore, pn{%\x) denotes the 
probability of a missed detection and Pn{yi, ■■■,yn\x) includes 
the probability of detecting (j/i,..., S E" given a specific 
target state x G S. The PGEL of the generalized PHD intensity 
filter is therefore given by 

'I'GenPHD(^,g) = S)) ; (77) 

which was first proposed in HI)- 

E. Multi-Bernoulli Intensity Filters 

A Bernoulli random variable (trial) has two outcomes, or 
events, that are usually labeled “success” and “failure.” Its 
PGP is G®®''(z) = 1 — q qz, z G C, where q G [0,1] is 
the probability of success. A multi-Bernoulli random variable 
is defined to be the number of successes in n mutually 
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independent Bernoulli trials. If qi G [0,1] is the probability 
of success in the trial, then the PGE of the number of 
successes is the product 


n 

= n (1 - ft + ft^) • (78) 

i=l 

Bernoulli models were used in Section ^for target detection 
modeling. They are used here to model target existence. 

The multi-Bernoulli intensity filter proposed in | [34) makes 
the same assumptions as the PHD intensity filter, except that 
the prior target process is assumed to be given by a multi- 
Bernoulli process. Enhanced versions, implementations and 
numerical examples of the multi-Bernoulli intensity filter can 
be found in | |68l , | |69l . 

The predicted target process is again a multi-Bernoulli 
process. This is guaranteed by assuming the birth process to be 
a multi-Bernoulli process, which is independent of the target 
process. Target death is modeled by independent probabilistic 
thinning. Targets are assumed to be independent Markovian 
processes having the same motion model pq{x\xo) described 
in Section HV-All 

The Bayes posterior process is not a multi-Bernoulli 
process. Thus, it is approximated by a multi-Bernoulli pro¬ 
cess consisting of a superposition of two independent multi- 
Bernoulli processes, one modeling data-induced targets, the 
other modeling existing targets. This assures the Bayesian 
recursion to be closed. 

The predicted target process is a multi-Bernoulli process 
with expected number of targets N = E{N). Let n = be 
the largest integer less than or equal to N. The PGEL of the 
multi-Bernoulli target process is given by 

n 

NMB{h, g) — Xi + Xi 'I'BMD(t)(^,5)) • (79) 

i=l 


Here, Xi denotes probability that the predicted (hypothe¬ 
sized) target is indeed a target, i.e., that it exists. Analogously 


to (621, the PGEL for data-induced targets is given by 


m 

'i’MMB{h,g) = n “ 7^ + Ij ^BMD(i)(*>ff)) ■ 


(80) 


i=i 


In general a cluster process as defined in ( [2^ is proposed here 
to model clutter. Then, due to the superposition of the target- 
originated measurement process with the independent clutter 
process, the PGEL of the multi-Bernoulli filter is 


'^'“(5, h) = vI/“(5) ^NMBih,g) 'i’MMBiKg). (81) 


Substituting a PPP clutter model for the more general cluster 
process model in ( [ST) gives 

'^MB{g,h) = '^^^{g)'i’NMBih,g)'i’MMBih,g). (82) 


This is the PGEL of the multi-Bernoulli intensity filter given 
in 

Comparing the joint PGEL of the data-driven JIDPA filter 
from ( |6j| ) with the PGEL of the multi-Bernoulli filter in ( |8^ , 
it is evident that the PGELs of both filters differ only in the 
application of superposition. The data-driven JIPDA has as 
many target state spaces as there are targets and measurements. 
In contrast, the targets within the multi-Bernoulli filter all share 
the same target state space. Therefore, the multi-Bernoulli filter 


can be described as a data-driven JIDPA filter which employs 
superposition. The multi-Bernoulli filter is also described as a 


superposed single-scan MHT algorithm in Section V-G 


The connections between the multi-Bernoulli, the JIPDA, 
and the MHT filters are studied in terms of random finite sets 
in fT^ . The multi-Bernoulli filter derived in is closely 
related to the set JPDA (SJPDA) 166) filter. 


Various extensions of the multi-Bernoulli filter are pro¬ 
posed EZ) including labeled versions to keep account of the 
target identity. The labeled multi-Bernoulli process is described 
in EZ) as a random finite set on the Cartesian product of the 
state and label spaces, but this is inaccurate, for then the labels 
would be random, which they are not. As it can be seen from 
Table |XIll and the discussions above, the labeled version of the 
multi-Bernoulli filter corresponds to the JIPDA filter since the 
incorporation of target labels is equivalent to not superposing 
the targets onto one state space. 


VH. Hybrid Pointillist Eilters 

The filters presented in this section assume at most n target 
groups to be present. Therefore, each target group has its own 
state space and, within these groups, targets are superposed. 

It is also sometimes possible to superpose some targets and 
not others. Selective superposition has its uses. Eor example, 
targets that are well-separated could be superposed in one tar¬ 
get state space, thereby possibly reducing computational com¬ 
plexity without incurring significant information loss, while the 
remaining targets are not superposed. Other examples involve 
highly mixed scenarios in which some groups of targets may 
have the same “within-group” dynamical model (e.g., constant 
velocity), but with different groups having different dynamical 
models. Selective superposition procedures are an additional 
decision step, which is outside the scope of this paper. 


A. Joint PHD Intensity Filter 

The joint PHD intensity filter lEZ) assumes that exactly 
n > 1 target groups are present. The groups are mutually in¬ 
dependent. Each group has its own state space Si, i = 1 ,..., n, 
and the targets in a group generate at most one measurement. 
Analogously to the PHD intensity filter (see Section VI-B| l the 
target groups constitute a finite point process at the recursion 
start time. The PGEL of the updated process is not necessarily 
a PPP and thus it is approximated by a PPP for closing the 
Bayesian recursion. The predicted target process is a PPP, since 
independent probabilistic thinning (target death) is assumed 
per target group and independent Markovian targets of one 
target group are assumed to have the same motion model. 

The clutter process is assumed to be a PPP. Eurthermore, 
the processes of the target groups and clutter are assumed to 
be mutually independent and hence the PGEL of the joint PHD 
intensity filter is given by 


^'jointPHD(/ll, G^^(^'BMD(/i*, ff)) , 


i=l 


(83) 


where G^^{z) = e is the PGP of the target number 

of the z-th target group and Ni is the expected number of 
targets of the predicted target process. 
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B. Joint Generalized PHD Intensity Filter 

Analogously to the joint PHD intensity filter, the joint 
generalized PHD filter assumes exactly n > 1 target groups to 
be present and these groups are mutually independent. As in 
the joint PHD intensity filter each group has its own state 
space Si, i = Targets in a group are allowed to 

possess an arbitrary target-oriented measurement process, that 
is targets in a group can generate more than one measurement 
per sensor-scan. To the knowledge of the authors this filter has 
not been published yet. The target groups constitute a PPP at 
the recursion start time. Since the Bayes posterior process is 
not necessarily a PPP it is approximated by a PPP to close the 
Bayesian recursion. Due to the same arguments as in Section 
VII- A| the predicted target process is a PPP. 

As in Section IVTD] the clutter model can be chosen 
arbitrarily. Thus, the joint PGEL of the joint generalized PHD 
intensity filter is given by 

n 

^'jointGenPHD(^l, ^n,5) = H S)) j 

(84) 

where is the PGEL of the arbitrarily specified clutter 

model used in ( fTT) ) of Section |VI-D| 


VIH. Closing the Bayesian Recursion 


Eor all pointillist filters, the PGEL of the Bayes posterior 
point process can be characterized by a ratio of functional 
derivatives of the joint PGEL p^ , | |5^ , |TT|. Eor discrete 
multivariate distributions, the same kind of derivative ratio for 
the PGE of the Bayes conditional distribution has long been 
known Equ. (34.48), p. 11]. 


A Bayesian recursion is said to be closed, or exact, if the 
probability distributions of the prior and posterior processes 
have the same mathematical forms. (There is a class of filters, 
outside the scope of this paper, that is closed in this sense 
GD .) Eor the filters whose PGELs are given in this 
paper, however, it can be verified by inspection that the prior 
and posterior distributions take different forms. Consequently, 
to close the recursion the posterior point process must be 
approximated by a process having the same form as the prior. 

Methods for closing the Bayes recursion for pointillist 
filters that superpose targets have been discussed above. These 
methods make critical use of the summary statistics of the 
Bayes posterior process. One problem, already mentioned 
in Section |VI-A[ arises when the Bayes posterior process 
is approximated by a simpler process, e.g., a PPP, to close 
the Bayesian recursion. The problem is that the targets are 
i.i.d. samples of the same PDE and, intuitively speaking, 
they are drawn with replacement. Thus, even if the PDE has 
as many ’’bumps” (local maxima) as there are targets, the 
target samples can with nonzero probability all be drawn 
from the same bump. These samples should be drawn without 
replacement so that distinct targets are sampled only once. 
PPPs therefore mis-model the Bayes posterior multitarget 
state point process. 

Closing the Bayes recursion is theoretically easier for 
non-superposed filters because there are distinct state spaces 
for each possible target. When all targets are assumed to 
exist, it can be shown from the PGEL that the intensity 
function (or PDE, if the PGEL is multilinear, as discussed in 


Appendix |A-2[ ) is defined over the Cartesian product of target 
state spaces. The difficulty is that the high dimensionality 
makes it impractical to carry an estimate of the joint 
intensity. One traditional statistical approach to reducing the 
dimensionality of multivariate problems is to approximate the 
joint distributions as a product of the n univariate marginal 
distributions I®, 1^. The product form is reasonable given 
that targets are mutually independent. This approach is taken 
by JPDA 0, 0. The Bayes recursion is closed because the 
prior is assumed to have the same product form. 

Non-superposed pointillist filters encounter other issues 
when, as in JIPDA, there is at most one target in each target 
state space, i.e., when a target may or may not exist. In this 
case marginalizing over all but one target and conditioning 
on its target existence enables a conditional target state 
PDE to be found, together with the probability of target 
existence gg. Cycling through the targets one at a time gives 
a discrete-continuous marginal distribution for each target. 
Using the product approximation for the joint distribution 
closes the Bayes recursion just as in JPDA. 


IX. Target State Estimation 


The minimum Bayes risk estimate is defined for finite point 
processes as the realization of the process that minimizes the 
expected value of a specified cost function 15^ . The Bayes 
estimate yields an estimate of the number of targets and their 
states. In many problems the minimum Bayes risk estimate is 
equivalent to a maximum a posteriori (MAP) estimate for an 
appropriately defined cost function; however, this is not one of 
those problems because realizations of finite point processes 
usually have different numbers of points. 

The Bayes estimate is impractical; instead, pseudo-Map 
estimates are found using the summary statistics of the Bayes 
posterior process. The estimated number of targets, fipseudoMAP, 
is the maximum of the (discrete) posterior distribution of the 
number of targets, as determined from the PGEL (see Ap¬ 
pendix A, (129 1 -( 130l). Subsequently, fipseudoMAP target states 
estimates are obtained from the intensity function (or first 
moment, or PHD) of the Bayes posterior process. Pseudo-MAP 
surrogates are intuitively reasonable when they correspond to 
stable local peaks of the intensity function. The local peaks 
are often unstable |34), Gl)> so practical difficulties abound. 
When two or more targets coalesce into one intensity peak, it 
is not clear how to proceed. A nontrivial theoretical objection 
is that the intensity (number of targets per unit state space) 
is a summary statistic and not a full multitarget PDE, so 
the meaning of estimates obtained from it are of unknown 
statistical character. Efforts to overcome these problems have 
been investigated elsewhere (48), (3T). 

Tracking filters that do not use target superposition main¬ 
tain separate state spaces for each target. This makes the target 
state estimation problem relatively straightforward, at least 
theoretically. As discussed in the previous section a common 
way to handle large numbers of targets is to write the joint 
posterior distribution as a product of marginal distributions. 


X. How TO Design a Tracking Eilter: An 
Engineer’s Perspective 

The solution of a practical tracking problem using finite 
point processes can be separated into two methodologically 
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Fig. 1. Discovery Step. A tracking filter is fully characterized by its joint tai'get-measurement PGFL. This figure depicts the palette of available point process 
models for targets and measurement used in the PGFLs of the filters studied in sections up to and including Section It highlights the construction of the 
PGFL for the JIPDA filter. 


different steps. Eirst, the PGEL is constructed using different 
application-specific ingredients, such as the target, measure¬ 
ment and clutter model, complete, partial or no superposition 
of target states, the target-generated measurement model, the 
sensor resolution model, etc. This process is called the Dis¬ 
covery Step in Section]^ and its component steps are depicted 
in Eigure [T] Up to this point, the paper has studied many 
different hlters using this basic procedure. It can be seen from 
Eigure that several combinations with various models for 
the ingredients of a tracking filter are possible, and this leads 
to an enormous number of different possible tracking hlters. 
The enthusiastic tracking engineer can use Eigure [T] as an 
inspiration to design new customized tracking hlters that ht 
the problem of current interestj^ 

If the tracking problem is modeled by constructing the 
PGEL, the second step is to derive the formulas needed for 
the implementation of the particular tracking hlter, i.e., to 
compute the summary statistics (hrst or higher order moments, 
distribution of the number of targets, etc.) needed to close 
the Bayesian recursion. These statistics are given by ratios of 
functional derivatives with respect to the Dirac delta, evaluated 
at different points of the state and measurement spaces p4[ , 
HT), (see also Appendix 0- Eigure visualizes this 


'The situation calls to mind James Clerk Maxwell’s famous words: “There 
is nothing more practical than a good theoryT 


Analytical Step using the PHD hlter |34|, p^ , y^ , the hrst 
hlter to be derived from a PGEL. 

Alternatives to functional differentiation are discussed in 
Section XI Several of the methods presented there make it 
possible to assess particle hlter performance on simulated data 
sets quickly and reliably, that is, they provide a very cost 
effective way to explore the design space without the need 
for expensive hand-crafted code. 

The framework of PGELs and point processes used in the 
previous sections is of more than purely theoretical interest. 
Eirst, it shows similarities and differences between existing 
well-known tracking hlters. Eor example, the consideration of 
how the superposition of targets is modeled within the frame¬ 
work of point processes brings to light the close connection 
of the multi-Bernoulli, the un-pruned/merged MHT and the 
JIPDA hlter. Eurthermore, it enables the reader to understand 
better the kinds of challenges that arise if new assumptions 
are made (e.g., on target superposition. Section |IX[ ) or if 
old assumptions are altered—seemingly small changes can 
have disproportionate impact. The proposed design procedure 
therefore helps the experienced tracking engineer to understand 
existing hlters and their connections. 

Second, the framework gives birth to an entirely new class 
(to the knowledge of the authors) of tracking hlters, called 
hybrid pointillist hlters, by a straightforward application of 
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the assumption that target states are superposed only within 
a specific number of target groups. It is evident to ask for 
detailed numerical evaluations of this new class of filters; 
however, a close investigation of practical applications is not 
part of this work and will be presented in future publications. 

In addition to the important discoveries mentioned above, 
a practical working tracking engineer may still ask: What’s 
in it for me? The answer is: Unifying tracking filters in a 
common framework offers the possibility of customized de¬ 
signs for application-specific tracking problems that engineers 
are confronted with in practical work. Once understood it 
offers an easy way of finding out which existing methods 
can be re-used and which parts of the problem have to be 
modeled in a different way. Thereby, the ingredients such as 
clutter model, target-generated measurement model, etc. of the 
existing tracking filters can be mixed in a large variety of 
combinations, as depicted in Eigure[T] In the following, a guide 
to how to design a customized tracking filter is presented for 
a specific tracking scenario by making use of the essential 
cornerstone-parameters of the problem. 

Assume exactly two unresolved targets a;i S S*!, X 2 & S 2 
with probability of detection pd, i{xi,X2) and pd,2{xi,X2), 
respectively, to be present in a cluttered environment. The 
target’s identity is of particular interest and one target generates 
at most one measurement. This problem has already been 
formulated in j^) (see also Q, fTO] ) and should demonstrate 
how a tracking filter is formulated in terms of its joint PGEL. 
The following basic questions for designing a tracking filter 
are answered for each of the columns of Eigure for the 
unresolved target problem. By following these questions, the 
columns of Eigure [T] provides a guide for formulating custom 
filters for other problems. 


1 ) 


2 ) 


What is the model of target missed detections? 
Choose a standard missed detection model. Eor ease 
of discussion it is assumed that pDj{xi,X 2 ) = 
pjjj{xj), j = 1, 2, that is, the probability of detecting 
target j depends only on target j. Adopting the 
notation (201 gives aj{xj) = 1 — pD,j{xj) and 

bj{xj) =pDj{xj), j = 1,2. 


How is the sensor resolution modeled? 

The resolution of the targets depends on how close 
they appear in the sensor. Eor example, the sensor 
resolution function can be Gaussian, i.e.. 


/‘■“(xi,X2) (85) 

where the matrix E is determined by the nature of 
the sensor and the system functions Hi : Si —>■ Y, 
H 2 : S '2 —>■ U map the target states to points 
in the measurement space. Eor this choice, targets 
at xi G S 2 ,X 2 G S 2 are poorly resolved by the 
sensor if f’^^^{xi,X 2 ) ~ 1 and well resolved if 
f'^^^{xi,X 2 ) ~ 0. Whether or not unresolved target 
states are close depends on the system functions Hi 
and H 2 . The Gaussian function can be replace 
by any reasonable function /‘'®^ : Si x S 2 ^ [0,1] 
provided that f'^^^{xi,X 2 ) = 0 for all xi G Si, 
X 2 G S 2 with Hi{xi) = H 2 {x 2 )- The resolution 
function, however defined, is used in the PGE of the 
number of measurements. 


3) How to model the number of measurements? 

Let the PGE of the number of measurements be given 


by 


^''m\xi,X2 (^) = Co + CiZ -b C2Z^, 

(86) 

where 


Co =ai(xi)a2(x2), 

Cl =ai(xi) 62 (x 2 ) + bi(xi)a2ix2) 

(87) 

+ /'‘''(xi,X 2 ) 6 i(xi) 62 (x 2 ), 

(88) 

c2 =bi{xi)b 2 ix 2 ){l - f‘^"{xi,X 2 )) 

(89) 


and •) is defined, for example, by ( [85] ). The 

number of target measurements depends on the dis¬ 
tance between the two targets in the sensor space. If 
xi « X 2 , then 3 : 2 ) « 1 and the PGE of the 

number of targets is 


4) 

5) 

6 ) 
7) 


Gm\x^,x2 (^) -Co + Ciz, zGC, (90) 

which means that two poorly resolved targets yield 
at most one measurement with high probability. On 
the other hand, the PGE for well resolved targets is 
quadratic and, moreover, factors into two linear terms: 

^M|a;i,a;2(^) “ Cq -|-ClZ -|-C2Z^ 

« (ai(xi) -b bi{xi)z) {a2{x2) + b2{x2)z) 

= (91) 

Thus, well resolved detected targets yield two condi¬ 
tionally independent measurements, as expected. 
How is target existence model defined? 

Two targets are assumed present, exactly as in JPDA, 
so no modeling of the target existence is needed. 

Is target identification needed? 

Yes. Therefore the joint PGEL will have two test 
functions, one for each target. 

How is the target number modeled? 

A fixed number of targets is assumed. 

What is the target-generated measurement model? 

It is assumed that a target generates at most one 
measurement per sensor scan. (This assumption can 
be relaxed; see ( [32l l and the extended version @.) 
Measurements, however, depend jointly on the states 
of both targets. The target-generated measurement 
model can be shown to be given by a modification 
of ( |3^ . It is defined by 


4'bMd(^1) ^ 2 j g) 

= / hi{xi)pi{xi)h2{x2)p2{X2) 

JSi JS2 

xG'M|rri,a;2 9iy)Piy\Xl, X 2 ) dy^ dxidX2, 

(92) 


where p{y\xi,X 2 ) is an arbitrarily specified likeli¬ 
hood function that is conditioned on both targets. It 
is worth pointing out that the PGEL ( |9^ reduces to 
the PGEL of the standard JPDA filter for two targets 
under two assumptions: (i) the factorization 
holds, that is, the targets are always well-separated; 
and (ii) the likelihood function p{y\xi,X 2 ) depends 
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Joint PGFL 





Fig. 2. Analytical Step. Derive summary st atistic s of the respective tracking filter by differentiating the PGFL of the Bayes posterior point process. The intensity 
function / in the general case is given by jl28) . This figure depicts the variety of choices available for any PGFL, while indicating those made for the PHD 
intensity filter. Symbolic functional derivatives of the PGFL must be done by hand, but lead to explicit formulas. The secular form of the PGFL (see |99j and 

S ) can be differentiated symbolically using widely available software. Exact numerical values for particle weights can be found by AD. Derivatives of all 
s of the secular PGFL can be written using the Cauchy integral method, which lends itself to saddle point approximation. See Section pQ| 


on only one target state not two. In this case p{y\xi) 
is used within G™®. (■)> * = 1,2. 

8) Which clutter model is used? 

In this case the clutter model is an arbitrary hnite 
point process. We choose the Poisson clutter model 
^^^( 5 ) ^nd assume further that the clutter process is 
mutually independent of both target-generated mea¬ 
surement processes. 

The target-generated measurement and clutter processes 
are mutually independent, so the joint PGEL is given by the 
product of their PGELs; 

^ 2 , 5 ) = ^2, <?)• (93) 

This PGEL fully characterizes the two target example problem 
for unresolved measurements. This completes the tracking 
hlter Discovery Step. 

Sensor resolution issues increase the complexity of the 
tracking problem. In the language of PGELs, this can be traced 
to the fact that the double integral involved in the target¬ 
generated measurement model factor into a 

product of two integrals as in the perfectly resolved case, since 
the PGE of the number of measurements is not the product 
of hrst degree polynomials (cf. ((92)i. This has a significant 


impact in practice because computing double integrals is 
computationally much more demanding than computing the 
product of two lower dimensional integrals. 

Secular functions can be dehned and derived for PGELs 
with general target-generated measurement models. This can 
be seen directly for the unresolved target example using a 
method that is similar to that used in | |58) . The proof of 
the result in the general case is more involved and will be 
presented elsewhere HZ)- 

Another example of multiple integrals that do not factor 
is given by 'l'GenPHD(^, 5 ) of the generalized PHD intensity 
hlter (j77]l. The reason in this case is that the generalized PHD 
hlter enables a target to create more than one measurement 
per sensor scan and thus the integrals with respect to the 
measurement space do not factor. 

It is seen from the unresolved target example discussed 
above that in practice engineers can use the pointillist hlter 
point of view to quickly design and characterize tracking hlters 
for modeling specihc problems of interest. 

The next step is the Analytical Step, which is needed 
to obtain the explicit formulas for the implementation of 
the designed tracking hlters. In the following section and 
Appendix it is shown how summary statistics such as 
the hrst order moment (intensity function) and higher order 
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moments (for pair-correlation) of the Bayes posterior process 
can be derived from the joint target-measurement PGEL found 
in the Discovery Step. These statistics are used in filter 
implementations to close the Bayesian recursion. 

XI. Alternatives to Eunctional Differentiation 

The distribution of the Bayes posterior point process is 
given by a ratio of functional derivatives with respect to the 
Dirac delta of the joint PGELs presented in Sections [V|- VII 
Eunctional differentiation is needed to derive the summary 
statistics that are used to define the respective tracking hlters 
and close the Bayesian recursion. 

This section presents alternatives to functional differentia¬ 
tion. Secular functions, which were first presented in | [58) , are 
defined and methods for differentiating them are presented. 
Eurther details can be found in | [58t , |^|, and fTTl . 

A. Secular Functions 

In the following, the definition of a secular function from 
is presented for univariate and multivariate PGELs. Details 
are omitted and can be found in HD- 

Let vp G 7^2, where 7^2 is defined in ( |116[ ). The PGELs 
of all investigated PGELs in this work fall into this class 
of PGELs and thus all tracking filters presented here can be 
derived by the following methods. Eor the univariate case the 
secular function is defined by 


J(a;c) =limT'(/i-b a 7 ^), 


(94) 


where x G X C a G C™, c G K™, X > 0, h : X —>■ R. 
Here, {7 ^*}a>o is a family of test functions for the Dirac delta. 
Eor multivariate PGELs the secular function is defined by 


J(a; a;) =limT'^h(a;)-b ai 7 ^’(a;)J (95) 

=A> -b ^ aiS'"' (x)^ , (96) 

where uses the Dirac delta as an evaluation operator, 
not a function. The definition (95 1 can be interpreted as the 
limit of an approximation to the Dirac delta 0^ IZD- Eurther 
discussion of the definitions ( [94l i and ( |95| l can be found in pT) 
and 158). Eor x G X and a G C"*, it holds that 


= ^J{a;x] 
OX da da 


q;=0 


(97) 


and for x = {xi,..., Xm} , Xi G X, i = 1,... ,m, 
^~dxi---dx^ 


dai 


- J(a; x) 


Q;i=--- = Q:rn=0 


= J„(0;x), (98) 


which shows that the Gateaux derivative with respect to 
multiple Dirac deltas is identical to the first order mixed 
derivative of the corresponding secular function. 

Secular functions can also extended to joint PGELs (see 


|58 Sec. 4.2]). Let 'I'(p, h) be the joint PGEL of the Bayes 


Y C and S C , di,d 2 > 0. Then, the secular function 
is given by 

J{a,(3;y,x) 

( m n 

{y),^ + Y. ^77a^ (x) I (99) 

i=l j=l 


T j=l 


( 100 ) 


where (lOOi uses the Dirac delta as an evaluation operator in 
the same way as it was used in (|96]l. It holds that 


Jai3{a,(3-,y,x) 


a=0,/3=0 


- 4/y3;(p, h) 


s(-)=0. M-)=1 


( 101 ) 


The n* factorial moment (122 1 of the Bayes posterior point 
process can be computed using secular functions by 

4'ya;(0,l) 


m[„](xi,...,x„|j/) = 


vl/,(0,l) ’ 


( 102 ) 


where 

4'y;r(0,l) = 

and 

'fy{0,h)^ 


9” 


gr, 


dyi - ■ ■ dym dxi ■ ■ ■ dx, 
d' 


■T'(p,h) (103) 






(104) 


dyi--- dy. 

B. Differentiating Secular Functions 

The following sections present methods for differentiating 
secular functions. 

1) Application of Cauchy’s Residue Theorem - Saddle 
Point Methods: Consider the complex function 

a I—>■ J(a; x) = lim'k j h{x) + ai'y‘f{x) ] , (105) 

where x G K", n > 0, a G C”^, T' G 7^2 and h G FL. Then, 
J(-;x) is analytic in some open region of the complex plane 
H C C™ containing (0, ...,0) G C"* |37 Sec. 4]. Denote 
by Hi,..., Hm C C the projections of the open region H to 


from C™ to C. Thus, Cauchy's coefficient formula |[^ p.236] 
can be applied to a i—J(a;x), x G K". Hence, the ordinary 
derivative of J(-;x) is given by 


dai ■ ■ ■ dur 

1 


- J(a; x) 


Q:i = -" = Q;^=0 

J(C;x) 


(27ri)™ Jc[' Jc^ («! - Ci)^ • ■ • (am - Cm) 


posterior process, where g and h are bounded functions on 


2 t^Cl ' ' * I^Cmj 
(106) 

where Ci,...,Cm in C are simple loops of 
encircling 0 G C, respectively. Due to the concept of secular 
functions and Cauchy’s coefficient theorem (which is an impli¬ 
cation of Cauchy’s residue theorem), the first factorial moment 
can be determined by contour integration. Eurthermore, note 
that the numerical approximation of contour integral with 
powerful saddle point methods has been extensively studied 
(se e PI , p0| ) and can be used to evaluate ( 106| l numerically. 
In ]59[ saddle point approximations for the JPDA hlter are 
investigated for SMC implementations. 
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2 ) Autojnatic Differentiation: Automatic differentiation 
(AD) methods are presented in | |2^ . These techniques compute 
numerical values of derivatives of a function without finding 
the symbolic derivative. AD methods are not approximations 
but exact (up to machine precision). Since the (ordinary) 
derivatives of secular functions are identical to the functional 
derivatives of PGELs, the method of AD can be used for 
evaluating the first factorial moment at any given point/particle, 
e.g., to determine exact particle weights. 

3) Classical Finite Differences and Maclaurin Series Ex¬ 
pansion: Since the first factorial moment in | [5^ Sec. 5] is 
given by the ratio of ordinary derivatives evaluated at zero, 
it can be determined using classical approximation techniques 
like classical finite differences or a Maclaurin series expansion. 
The details on these methods can be found in fSB) . 

XII. Conclusions And Euture Work 

The ontological perspective of analytic combinatorics is 
new. It enables the relationships between known and new 
filters to be described succinctly in Table |XII| As can be seen 
from the table, the unified analytic combinatoric perspective 
has significant added value - it shows that target superposition 
is a highly flexible tool that can be used in many ways. 
The discovery of the new family of hybrid pointillist filters 
demonstrates that the new perspective is much more than a 
framework for comparing filter combinatorics. 

The pointillist family of tracking filters are those that 
can be derived via the joint PGELs of target states and 
measurements, when the states and measurements are 
modeled by finite point processes. Three classes of pointillist 
filters are proposed. The best known class comprises filters 
that do not employ target superposition, that is, each target 
has its own state space. Such filters include the classical 
Bayes-Markov filter as well as the PDA, JPDA, IPDA, JIPDA, 
PMHT, and MHT filters. 

A newer class of pointillist filters are those that use 
target superposition, that is, all targets share the same state 
space. Examples of filters in this class are the PHD intensity, 
the CPHD intensity, the generalized PHD intensity, and the 
multi-Bernoulli filters. It is shown that some of these filters 
(multi-Bernoulli, CPHD with fixed number of targets) can 
be formulated and derived from superposed versions of the 
joint PGELs of the non-superposed JPDA, JIPDA, and MHT 
tracking filters. 

Between these two main classes of filters is the class of 
hybrid pointillist filters, in which some of the targets are 
superposed and others not. Examples of this hybrid class 
presented in this paper are the joint PHD intensity and the 
generalized joint PHD intensity filters. 

An alternative to functional differentiation of the PGEL is 
presented. This technique replaces the PGEL with an ordinary, 
or secular, function and functional derivatives by ordinary 
derivatives. The two methods are equivalent theoretically; 
however, secular functions can be differentiated in many 
ways. Several are proposed in this paper, including a novel 
method involving the Cauchy residue theorem of classical 
complex analysis. 

Euture work will investigate the multi-scan versions of 
the single sensor pointillist filters discussed in this paper, 
including the multi-scan MHT, as well as multi-sensor 
versions of these filters. Some of these PGELs have already 


been derived and will be reported elsewhere. 
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Appendix A 

Einite Point Process Basics 

The results given in this appendix, together with supporting 
technical details, can be found (in different notations and 
naming conventions) in fig, |Tg, 1^, 1^, ig, and 
the references cited therein. The latter two reTarences, and 
the extensive literature cited in them, discuss a closely related 
version of the theory of finite point processes called finite set 
statistics (EISST) and random finite sets (RESs). 

Let A" be a separable metric space. A typical choice for 
X is d > 0, which is sufficient for the most applications 
appearing in target tracking. The space of sets of points or 
event space in X is defined by 

Sx X{n), (107) 

n>l 

where X{n) is the space of sets of size n S N, that is 

X{n) := {{xi, ...,x„}|xi € X,i=l, . (108) 

It is assumed that each element (p G £x\^ is, locally finite, i.e., 
each bounded subset of X can contain only a finite number of 
points of ip, and is simple, i.e., 

Vxi, Xj G p, Xi = Xj ^ i = j. (109) 

In physics (thermodynamics), £x is called the grand canonical 
ensemble, X (n) is called the n* canonical ensemble and n is 
called the canonical number. In terms of target tracking this 
translates to the assumptions that only finitely many targets 
can be present in a scenario and that no two targets share the 
same state. In tracking the number of targets is the canonical 
number and is usually referred to as the cardinal number. 

The definition of the grand canonical ensemble must be 
modified to accommodate applications in which the state space 
X is discrete or continuous-discrete. These kinds of spaces 
arise occasionally in tracking; for example, see 1^, j^, and 
15^ . In such spaces the discrete elements of X are allowed to 
be repeated, i.e., the sets in the grand canonical ensemble are 
allowed to be multisets. 

A stochastic process in the sense of is defined as a 
measurable mapping 

^■.{n,F,V)^{£x,B{£x)), ( 110 ) 

where P) is an arbitrary probability space and B{£x) 

denotes the Borel tr-algebra of £x- Let denote the push- 
forward (image) measure of P, using the point process $. Eor 
any bounded and Lebesgue-integrable function 

h:{X,B{X))-t{R,B{R)) (111) 
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Filter Name 

PGFL Notation 

Joint Target-Measurement PGFL 

Ref. 

Eqn. 

Target 

Model 

Clutter 

Missed 

Det. 

Super- 

pos. 

Bayes-Markov 

^bm(^i 9) 

Is Iy h{x)g{y)ii{x)p(y\x) dy dx 


31 


= 1 pt- tgt. 

No 

No 

No 

Missed Detections 

^bmd(^, 9) 

Is Hx)Kx) (Jy giy}p(y\x) dy) dx 


!! 


= 1 pt. tgt. 

No 

Std. 

No 

Missed Detections 

^bme(^, 9) 

/g h(x)fj,(x) Gm-|x ( Iy aiy)p{y\x) dy) dx 


!! 


= 1 ext. tgt. 

No 

Gen.® 

No 

and Extended Target 










PMHT 








Without Superposition 

^PMHT(h'l , . . . ,hn, 9) 

n"=i 


48 


= n ext. tgts. 

No^ 

No* 

No 

With Superposition 

^PMHTs(/i, 9) 

n"=i 


w 


= n ext. tgts. 

No^ 

No* 

Yes 

PDA 








Without Gating 


'S’BMD{h,g) 


34 


= 1 pt. tgt. 

PPP 

Std. 

No 

With Gating 

^pda(^, 9) 



4T 


= 1 pt. tgt. 

PPP 

Std. 

No 

Extended Target 

^PDAE(/i, 9) 



1! 


= 1 ext. tgt. 

PPP 

Gen.® 

No 

JPDA 








Without Superposition 

^JPDa(/^1, . . . ,hn, 9) 

'I'c'”(9) nr=l 9 ) 


44 


= n pt. tgts. 

PPP 

Std. 

No 

With Superposition 

^JPDAs(^, 9) 

^JPDaC^i ■ ■ ■ 9) 


55 


= n pt. tgts. 

PPP 

Std. 

Yes 

IPDA 

'^lPDA(h,g) 

'I'c'’(9) (1 - X + X 'I'bmd(/i, 9)) 


52 


< 1 pt. tgt. 

PPP 

Std. 

No 

JIPDA 








Without Gating 

^JIPDA(/il , ■ ■ ■ ,hn, 9) 

'I'c’’(9) nr=l (1 - Xi + Xi 'I'BMD{i)(^i.9)) 


54 


< n pt. tgts. 

PPP 

Std. 

No 

With Gating 






< n pt.. tgts. 

PPP 

Std. 

No 

With Superposition 

^JIPDAs(/i, 9) 

^jipda(^, . . . ,h,g) 


55 


< n pt. tgts. 

PPP 

Std. 

Yes 

MHT^ 

• • • , hnJrm, p) 

'I'c''(9) nr=l (1 “ Xi + Xi 'I'BMD(i)(^i. 9)) 


62 


< n m 

PPP 

Std. 

No 



^ rijzzzl ~ 7j + I3 ’^BMD(j) , 5 )^ 




pt. tgts. 




Multi-Bernoulli 

g) 

'I'c'”(9) n"=l (1 — Xi + Xi 'I'BMD(i)(^.9)) 


82 


< n m 

PPP 

Std. 

Yes 



^ (1 “ 7j + 7i ^BMD(j) (^’ 5 )) 




pt. tgts. 




PHD 

^PHD(/i, 9) 

't&"’’(9) G™(vliBMD(/i.9)) 


70 


ppp 

PPP 

Std. 

Yes 

CPHD 

^CPHD(/i, 9 ) 

'tg“"(9)G““‘“(v|iBMD(/i,9)) 


73 


Cluster 

Cluster 

Std. 

Yes 

Generalized PHD 

^GenPHD(^, p) 

v|ig”(9)G«('P|™('‘.9)) 


77 


Gen.^ 

Gen.® 

Gen.^ 

Yes 

Joint PHD 

^JoinlPHD(/^l5 ■ ■ ■ , hn, g) 

'tg’’(9) nr=iG?^:('I'BMD(/i,.9)) 


83 


^ n gps. 

PPP 

Std.* 

Yes* 

Joint Gen. PHD 

^JointGenPHD(h'l ^ hn, g) 

>^§"(9) nr=i G™(vprMD('^..9)) 


84 


^ n gps. 

Gen.® 

Gen.*'^ 

Yes* 


TABLE I. Overview of the pointillist family of multitarget tracking filters 

I Clutter is modeled as target with high variance 2 MHT of un-pruned and complete set of hypothesis 3 Within target groups 4 Ai'bitrary target-oriented measurement process 
5 Extended target measurement model 6 General finite point process clutter model 7 General finite point process target model 8 Modeled via missing data assignment weight 
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the PGEL = 'I'^ of the point process $ is defined by 

/ n 

Y]_h{x^)P,s,{d{xi,...,Xn}) (112) 

= ^— Y]_Hxi)PniXl,---,Xn)dxi---dXn, (113) 

n>0 ^71 2—1 


where : A’" —K is the joint probability density of the 
corresponding Jannossy measure m and defined such that 


/ n\P^{d{xi,...,Xn})= / Pn{Xl,...,Xn)dxi--- dXn 
Jb Jb 

(114) 


and h, h' G P. It can be shown by the application of 
the Lebesgue Dominated Convergence Theorem |[T] that the 
Gateaux derivative with respect to the Dirac delta ( |118| l is 
well-defined for the PGELs from ( |1 16| l fTT] . 

The derivative with respect to multiple real-valued, 
bounded and integrable variations u>i,..., is defined it¬ 
eratively, that is, by applying either (|115|l or (1 18|l iteratively; 


( 120 ) 

Alternatively, the method of simultaneous perturbation can 
be chosen to define the functional derivative with respect to 


several variations |37 Eq. (4.11)]. This method is used to 
define secular functions in Section IXI-AI 


holds for all B € B{£x)- Equation (113i holds due 

to the assumed absolute continuity of P^ and the 
application of the Radon-Nikodym Theorem. Note that 
P^{d{xi,... ,Xn}) is the probability of the ordered event, 
and pn{xi,... ,Xn) dxi ■ ■ ■ dxn is the probability of the 
unordered event. 


I) Gateaux derivative with respect to the Dirac delta: Let 
4' be a PGEL defined as in ( |113| ). Then the Gateaux derivative 
of 4' with respect to the variation uj is defined by 


doj 


'^(h -f ew) — 4'(ft,) 

lim- 

e\,0 e 


(115) 


where w is a real-valued, bounded and Lebesgue-integrable 
function on X. In fTSl the Dirac delta is defined to be a func¬ 
tion which satisfies 5{x) = 0 when x ^ 0 and J S(x)dx = 1. 
Due to the property of the Lebesgue integral that it does not 
depend on changing the integrand at one point, S cannot be a 
proper function. See | [7T[ for a proper handling of Dirac delta 
as a distribution. Alternative references are | |64l and | |2T1 . Dirac 
delta evaluated at the point c is defined by S'^(x) = S{x — c). 

The class of PGELs of interest is defined by 


P2 = '1^4' : Tf —?> IR 

/ n 

Y[Hx^)pn{xi, 

n^U 1 


Xn) dxi ■ ■ ■ dXn 


(116) 


where 


T-L = {h : X ^ R\h is bounded and Lebesgue-integrable}, 

(117) 


and pn : A" —>■ K is a symmetric PDE. 

Eor a family of test functions {7 £}a>o of that is a 
family of functions being an approximate identity in the sense 
of p. 114], the functional derivative with respect to an 
impulse c G X is defined by 


where 


94- 

l)c 


94' 

(h) = lim — 
A\,o 97^ 


(h), 


94- 


\h'] = lim 

e\,0 


4'(/i-be/i') - 4'(/i) 
e 


(118) 


(119) 


2) Factorial Moments and Summary Statistics: The first 
factorial moment, or intensity function, of the point process 
is defined by the functional derivative of its PGEL 4*(ft,) with 
respect to the impulse x G X hy 

94- 

" 1 ( 1 ] (a:) =-^(1). (121) 

Eor PPPs it is straightforward to verify that the first factorial 
moment is the intensity function that parameterizes it. 

Higher order factorial moments are defined as higher order 
derivatives of 4*(ft) evaluated at h{x) = 1 | |T^ . Thus, the n* 
factorial moment is the functional derivative of 4* with respect 
to the impulses xi,..., x„ G X, that is, 

9” 4' 

m[„](xi,...,x„) = (122) 

Higher order moments are used in tracking applications Q to 
characterize the pair-correlation structure m of the Bayes 
posterior process before closing the Bayes recursion. This 
important topic is outside the scope of the present paper. 

The first factorial moment, or intensity, for multivariate 
PGELs is an extension of the definition ( |1211 i. Let the test 
functions hj : Xj —)■ K, j = 1,..., n be non-negative and 
bounded, and let x = (xi,... ,x„)^ G Xi x ■ ■ ■ x Xn- The 
first factorial moment of a multivariate PGEL 4'(fti,..., ft„) 
is defined as the mixed first-order partial derivative 

(123) 

— hn — 1 

The intensity is seen to be a multivariate function defined on 
Xi X ■ ■ ■ X Xn- The higher order (mixed) factorial moments 
TO[fci fe^](x) can be defined analogously, as the mixed partial 
derivative of order fci,..., with respect to the test functions 
hi,, ft„, respectively. 

The functional T*)-) is linear if, for all test functions h,g 
and constants a,b G C, 

4'(aft-fftg) = a4'(ft)-f64'(p). (124) 


Mb- 


94'(fti,..., ft„) 

,i](2^) = 


9x1 • • • dXr, 


It is straightforward to see that for linear functionals, the 
only nonzero probability realizations of the point process have 
exactly one point, and the PDE of this point (in continuous 
spaces X) is identical to the intensity (1211. Multivariate 
functionals are multilinear if they are linear in each test 
function separately. As in the univariate case, it is easily 
verified that, with probability one, realizations are singleton 
points (in a Cartesian product space), and the multivariate PDE 
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is identical to the intensity function. 

The PGEL of the Bayes posterior process and its intensity 
function is now derived. Let X and y be the target state 
and measurement space respectively. Typically X C and 
y C di, ^2 > 0. Let T be a finite point process with 
events in £y, and let S be a hnite point process with elements 
in Sx- Then analogously to ( |113[ ) the bivariate PGEL is defined 
on the product space SyxEx as the expectation of the product 
of random products diVi) h{xj), that is, 




tIt?,! 


m—O n—O 




W_9{yi)WKx]) 

i=i 


X PTH(t/l, . . . , t/m, Xi, . . . , CCn) dt/i • • • dyra dxi ■ ■ ■ dXn, 

(125) 


where : 3^ x T” —K is a symmetric PDE in the hrst and 
second arguments, respectively. Marginalizing with respect to 
one process yields the PGEL of the other, that is, 

d) = T'h(/i) and '1 'xh(5j 1) =(126) 


Using functional derivative with respect to impulses and apply¬ 
ing Bayes rule yields the PGEL of the Bayes posterior process 
(see details in |[53|), that is, 

dyi-dvrr, 


The ratio (127 1 for multi-target applications is given in 134 p. 


757], where it is described as the PGEL form of the “multi¬ 
target corrector.” Conditional PGEs for discrete multivariate 
distributions take exactly the same form |27 p. 11]. 

The intensity of the Bayes posterior process is often used 
as a summary statistic. Eor x G X, it is given by 


• dymOx 


( 0 , 1 ) 




(128) 


dyi -- dym 

Another summary statistic is the posterior distribution of the 
canonical (cardinal) number, which is given by 


n\ dx^ 


dx'^ 


(sPt (».!)) 


x—0 


dyi--- dym 


( 0 , 1 ) 


where 




dyi-- - dym dx 


( 0 , 1 ) 


h{-)^z 


dyi - dym 


( 0 , 1 ) 


(129) 


(130) 


z G C, denotes the PGE of the number of targets. The 
expected number of targets is given by ^F“l^(l). 


Appendix B 

Stacked Random Vectors and Random Einite Sets 

The difference between an ordered stack of n random 
vectors and a random set comprising the same n vectors is 
explained in terms of superposition. The difference comes alive 
in tracking problems because target labels are arbitrary and the 
order of the vectors in the stacked vector is irrelevant. 


Denote the n random variables by Xi and their realiza¬ 
tions by Xi G S. The stacked random vector is the n-tuple 
{xi ,..., Xn), while the random set is {xi,..., Xn}- The fact 
that the order is irrelevant is often misinterpreted to mean that 
the stacked vector and the random set are somehow equivalent. 
Careful examination shows that the random set is properly 
conceptualized as the superposition of a joint point process 
whose realizations are those of the random vector variable 
X = {Xi ,..., Xn) defined on the product space 5'". 

The difference is seen clearly by comparing PGELs. The 
stacked random vector has the joint PGEL 4/(hi,..., ) with 

n test functions, each defined on S. Realizations of the joint 
point process comprise exactly one n-tuple (with probability 
one). Its joint PGEL is 


4'(hi,...,h„) 



n 

(^]^hj(xi)^px(a:i, ...,Xn)dxi.. .dxn, 
2=1 


(131) 


where px{xi ^..., Xn) is the joint PDE of the random vector 
X. In contrast the PGEL of the random set is the diagonal of 
this joint PGEL, namely 


n. n 

'^s{h)= / (^Y\_H^t))pxi^l^---^^n)dxi---dXn. 

•^‘ 5 ’* i=l 

(132) 

It has only one test function, also defined on S. The PGELs 
are different, so the processes are different as well. 

The PDE of the stacked vector is found by functional 
differentiation of (131 1 with respect to the test functions hi, 
each with a single point mass at Xi. As expected, the derivative 
is px{xi,..., Xn)- In contrast, the PDE of the random set is 
found by functional differentiation of ( |132| ) with respect to the 
test function h with n point masses, one at each Xi- It can be 
shown that the derivative is 


p{{xi,. . . ,Xn}) = X Px{Xa(l),---,X^(n)), (133) 

crGSym(n) 


where Sym(n) is the set of permutations of (l,...,n). This 
expression is a sum over all random vectors that yield the 
random set {cci,...,x„}. It can be derived by a tedious direct 
calculation, or by an easy exercise using the Cauchy residue 
theorem | [5^ Appendix]). (Eor similar expressions related to 
measurement to target assignment problems, see ©.) 
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